Categorization of High-Dimensional Knowledge-Based Representations of File Data Using Abstract Self-Organizing Maps
نویسنده
چکیده
Here a new algorithm based on the Self Organizing Map is presented which allows for easier categorization of textual data. The algorithm is based heavily on the concept of metadata and its extraction and providing an abstract look at that data. The algorithm is titled the Abstract Self Organizing Map for that reason. The paper discusses the current and potential impact of metadata, the current paradigm for file organization, Teuvo Kohonen’s traditional SOM algorithm along with my new adaptation of it. Preliminary results are presented based on organization of real world file data. These results show that the algorithm is in fact effective in these situations and an outlook towards the future based on these results is discussed.
منابع مشابه
Steel Consumption Forecasting Using Nonlinear Pattern Recognition Model Based on Self-Organizing Maps
Steel consumption is a critical factor affecting pricing decisions and a key element to achieve sustainable industrial development. Forecasting future trends of steel consumption based on analysis of nonlinear patterns using artificial intelligence (AI) techniques is the main purpose of this paper. Because there are several features affecting target variable which make the analysis of relations...
متن کاملGreen Product Consumers Segmentation Using Self-Organizing Maps in Iran
This study aims to segment the market based on demographical, psychological, and behavioral variables, and seeks to investigate their relationship with green consumer behavior. In this research, self-organizing maps are used to segment and to determine the features of green consumer behavior. This was a survey type of research study in which eight variables were selected from the demographical,...
متن کاملSelf-organizing systems for knowledge discovery in large databases
We present a framework in which self-organizing systems can be used to perform change of representation on knowledge discovery problems, to learn from very large databases. Clustering using self-organizing maps is applied to produce multiple, intermediate training targets that are used to define a new supervised learning and mixture estimation problem. The input data is partitioned using a stat...
متن کاملGrowing hierarchical self-organizing map method using category utility
In order to automatically obtain hierarchical knowledge representation from a certain data, an unsupervised learning method has been developed that overcomes two problems of the growing hierarchical self-organizing map (GHSOM) method, which uses the quantization error, the deviation of the input data, as evaluation measure of the growing maps: proper control of the growth process of each map is...
متن کاملA Comparison of Support Vector Machines and Self-Organizing Maps for e-Mail Categorization
This paper reports on experiments in multi-class document categorization with support vector machines and self-organizing maps. A data set consisting of personal e-mail messages is used for the experiments. Two distinct document representation formalisms are employed to characterize these messages, namely a standard word-based approach and a character n-gram document representation. Based on th...
متن کامل